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Abstract 

In a scheduling game, each player owns a job and chooses a machine to execute 
it. While the social cost is the maximal load over all machines (makespan), the cost 
(disutility) of each player is the completion time of its own job. In the game, players 
may follow selfish strategies to optimize their cost and therefore their behaviors do not 
necessarily lead the game to an equilibrium. Even in the case there is an equilibrium, 
its makespan might be much larger than the social optimum, and this inefficiency 
is measured by the price of anarchy - the worst ratio between the makespan of an 
equilibrium and the optimum. Coordination mechanisms aim to reduce the price of 
anarchy by designing scheduling policies that specify how jobs assigned to a same 
machine are to be scheduled. Typically these policies define the schedule according 
to the processing times as announced by the jobs. One could wonder if there are 
policies that do not require this knowledge, and still provide a good price of anarchy. 
This would make the processing times be private information and avoid the problem 
of truthfulness. In this paper we study these so-called non-clairvoyant policies. In 
particular, we study the RANDOM policy that schedules the jobs in a random order 
without preemption, and the EQUI policy that schedules the jobs in parallel using 
time-multiplexing, assigning each job an equal fraction of CPU time. 

For these models we study two important questions, the existence of Nash equilibria 
and the price of anarchy. We show that the game under RANDOM policy is a potential 
game for uniform machines or for two unrelated machines. However, it is not a potential 
game for three or more unrelated machines. Moreover, we prove that the game under 
the EQUI policy is a potential game. 

Next, we analyze the inefficiency of EQUI policy. Interestingly, the (strong) price 
of anarchy of EQUI, a non-clairvoyant policy, is asymptotically the same as that of the 
best strongly local policy - policies in which a machine may look at the processing time 
of jobs assigned to it. The result also indicates that knowledge of jobs' characteristics 
is not necessarily needed. 
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1 Introduction 



With the development of the Internet, large-scale autonomous systems became more and 
more important. The systems consist of many independent and selfish agents who com- 
pete for the usage of shared resources. Every configuration has some social cost, as well as 
individual costs for every agent. Due to the lack of coordination, the equilibrium configu- 
rations may have high cost compared to the global social optimum and this inefficiency can 
be captured by the price of anarchy |27] . It is defined as the ratio between the worst case 
performance of Nash equilibrium [29] and the global optimum. Since the behavior of the 
agents is influenced by the individual costs, it is natural to come up with mechanisms that 
both force the existence of Nash equilibria and reduce the price of anarchy. The idea is to 
try to reflect the social cost in the individual costs, so that selfish agents' behaviors result in 
a socially desired solution. In particular we are interested in scheduling games, where every 
player has to choose one machine on which to execute its job. The individual cost of a player 
is the completion time of its job, and the social cost is the largest completion time over all 
jobs, the makespan. For these games, so called coordination mechanisms have been studied 
by Christodoulou et al. [lOJ. A coordination mechanism is a set of local policies, one for every 
machine, that specify a schedule for the jobs assigned to it, and the schedule can depend 
only on these jobs. Most prior studied policies depend on the processing times and need 
the jobs to announce their processing times. The jobs could try to influence the schedule 
to their advantage by announcing not their correct processing times. There are two ways 
to deal with this issue. One is to design truthful coordination mechanisms where jobs have 
an incentive to announce their real processing times. Another way is to design mechanisms 
that do not depend on the processing times at all and this is the subject of this paper: we 
study coordination mechanisms based on so called non- clairvoyant policies that we define in 
this section. 

1.1 Preliminaries 

Scheduling The machine scheduling problem is defined as follows: we are given n jobs, 
m machines and each job needs to be scheduled on exactly one machine. In the most 
general case machine speeds are unrelated, and for every job 1 < i < n and every machine 
1 < j < m we are given an arbitrary processing time pi^j, which is the time spend by job i 
on machine j. A schedule a is a function mapping each job to some machine. The load of 
a machine j in schedule a is the total processing time of jobs assigned to this machine, i.e., 
ij = Ylra{i)=jPi„j- '^^^ makespan of a schedule is the maximal load over all machines, and is 
the social cost of a schedule. It is NP-hard to compute the global optimum even for identical 
machines, that is when pij does not depend on j, see [211 problem SS8]. We denote by OPT 
the makespan of the optimal schedule. 

Machine environments We consider four different machine environments, which all have 
their own justification. The most general environment concerns unrelated machines as de- 
fined above and is denoted -R||Cmax- In the identical machine scheduling model, denoted 
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-P||Cmax5 every job i comes with a length pi such that Pi^j = Pi for every machine j. In the 
uniform machine scheduhng model, denoted Q| |Cmax, again every job has length pi and every 
machine j a speed Sj such that pi^j = Pi/sj. For the restricted identical machine model, ev- 
ery job i comes with a length pi and a set of machines Si on which it can be scheduled, such 
that Pi^j = Pi for j G Si and Pi^j = oo otherwise. In this model is denoted PMPM\\C,^a.x, 
and in [21] it is denoted i?||Cmax- 

Scheduling game What we described so far are well known and extensively studied classi- 
cal scheduling problems. But now consider the situation where each of the n jobs is owned by 
an independent agent. In this paper we will sometimes abuse notation and identify the agent 
with his job. The agents do not care about the social optimum, their goal is to complete 
their job as soon as possible. We consider the situation where each agent can freely decide 
on which machine its job is to be scheduled. The actual schedule however is not decided by 
the agents. We rather fix a policy, known to all agents, which specifies the actual schedule, 
once all agents assigned their jobs to machines. Different policies are defined below. 

In the paper, we concentrate on pure strategies where each agent selects a single machine 
to process its job. Such a mapping a is called a strategy profile. Each agent is aware of the 
decisions made by other agents and behaves selfishly. The individual cost of a job is defined 
as its completion time. A pure Nash equilibrium is a schedule in which no agent has an 
incentive to unilaterally switch to another machine. In this paper we will simply omit the 
adjective pure, since there is no confusion possible. A strong Nash equilibrium is a schedule 
that is resilient to deviations of any coalition, i.e., no group of agents can cooperate and 
change their strategies in such a way that all players in the group strictly decrease their 
costs, see [21 US]- For some given strategy profile, a better response move of a job i is a 
strategy (machine) j such that if job i changes to job j, while all other players stick to their 
strategy, the cost of i decreases strictly. If there is such a move, we say that this job is 
unhappy, otherwise it is happy. In this setting a Nash equilibrium is a strategy profile where 
all jobs are happy. The better-response dynamic is the process of repeatedly choosing an 
arbitrary unhappy job and changing it to an arbitrary better response move. A potential 
game is a game in which for any instance, the better-response dynamic always converges [25] . 
Such a property is typically shown by the use of a potential function, which maps strategy 
profiles to non-negative numerical values. The game is called a strong potential game if there 
is a potential function with the property that if an agent improves its individual cost by some 
amount A, then the potential function decreases by the same amount A. 

A coordination mechanism is a set of scheduling policies, one for each machine, that deter- 
mines how to schedule jobs assigned to a machine ^U\. The idea is to connect the individual 
cost to the social cost, in such a way that the selfishness of the agents will lead to equilibria 
that have low social cost. How good is a given coordination mechanism? This is measured 
by the well-known price of anarchy (PoA), see p7]. It is defined as the ratio between the cost 
of the worst Nash equilibrium and the optimal cost, which is not an equilibrium in general. 
We also consider the strong price of anarchy (SPoA) which is the extension of the price of 
anarchy applied to strong Nash equilibria [16] . 
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Figure 1: Different scheduling policies for pa = 1,P_b = ^,Pc = '^iVd = 3. Tie is broken 
arbitrarily between jobs A and B. The rectangles represent the schedules on a single machine 
with time going from left to right and the height of a block being the amount of CPU assigned 
to the job. 



Policies A policy is a rule that specifies how the jobs that are assigned to a machine are 
to be scheduled. We now define several policies, and give proper credit to the introducing 



papers in section 1.2 



We distinguish between local, strongly local and non- clairvoyant policies. Let Sj be the 
set of jobs assigned to machine j. A policy is local if the scheduling of jobs on machine j 
depends only on the parameters of jobs in Sj, i.e., it may looks at the processing time Pi^k 
of a job i e Sj on any machine k. A policy is strongly local if it looks only at the processing 
time of jobs in Sj on machine j. We call a policy non-clairvoyant if the scheduling of jobs on 
machine j does not depend on the processing time of any job on any machine. In this paper 
we only study coordination mechanisms that use the same policy for all machines, as opposed 
to Angel et al. SPT and LPT are policies that schedule the jobs without preemption 
respectively in order of increasing or decreasing processing times with a deterministic tie- 
breaking rule for each machine. An interesting property of SPT is that it minimizes the sum 
of the completion times, while LPT has a better price of anarchy, because it incites small jobs 
to go on the least loaded machine which smoothes the loads. A policy that relates individual 
costs even stronger to the social cost is MAKESPAN, where jobs are scheduled in parallel on 
one machine using time-multiplexing and assigned each job a fraction of the CPU that is 
proportional to its processing time. As a result all jobs complete at the same time, and the 
individual cost is the load of the machine. All these policies are deterministic, in the sense 
that they map strategy profiles to a determined schedule. This is opposed to randomized 
policies which map strategy profiles to a distribution of schedules. 

What could a scheduler do in the non-clairvoyant case? He could either schedule the jobs 
in a random order or in parallel. The RANDOM policy schedules the jobs in a random order 
without preemption. Consider a job i assigned to machine j in the schedule a, then the cost 
of i under the RANDOM policy is its expected completion time, i.e., 

i' ■.a{i')=j, i'^i 

In other words the expected completion time of i is half of the total load of the machine, where 
job i counts twice. Again, as for MAKESPAN, the individual and social cost in RANDOM are 
strongly related, and it is likely that these policies should have the same price of anarchy. 
That is is indeed the case except for unrelated machines. 
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Another natural non-clairvoyant policy is EQUI, which has been studied for example 
in [13] in the different context of online algorithms. As MAKESPAN it schedules the jobs in 
parallel preemptivly using time-multiplexing, but it assigns to every job the same fraction 
of the CPU. Suppose there are k jobs with processing times pij < p2,j < • • • < Vkj assigned 
to machine j, we renumbered jobs from 1 to A; for this example. Since, each job receives the 
same amount of resource, then job 1 is completed at time Ci = kpi j. At that time, all jobs 
have remaining processing time {p-ij — Pij) < (Paj — Pij) < • • • < {Pkj — Pij)- Now the 
machine splits its resource into k — 1 parts until the moment job 2 is completed, which is at 
kpij + {k-l){p2,j — pi,j) — Pi,j + {k — l)p2,j- In general, the completion time of job i, which 
is also its cost, under EQUI policy is: 



We already distinguished policies depending on what information is needed from the jobs. 
In addition we distinguish between preemptive and non-preemptive policies, depending on 
the schedule that is produced. Among the policies we considered so far, only MAKESPAN 
and EQUI are preemptive, in the sense that they rely on time-multiplexing, which consists 
in executing arbitrary small slices of the jobs. Note that, EQUI is a realistic and quite 
popular policy. It is implemented in many operating systems such as Unix and Windows. 
See Figure [T] for an illustration of these five policies. 

Example For illustration consider the scheduling game on parallel identical machines and 
the EQUI policy. Here each of the n jobs has a processing time pi, for 1 < i n. Every 
agent selects a machine, which is described by a strategy profile a : {1, . . . , ra} — )■ {1, . . . , m}. 
Now the individual cost of agent i, is the completion time of its job, which for this policy is 



where the sum is taken over all jobs i' assigned to the same machine as i, i.e. a{i) = cr{i')- 
1.2 Previous and related work 

Coordination mechanism are related to local search algorithms. The local improvement 
moves in the local search algorithm correspond to the better-response moves of players in 
the game defined by the coordination mechanism. Some results on local search algorithms 
for scheduling problem are surveyed in [33] . 

Most previous work concerned non-preemptive strongly local policies, in particular the 
MAKESPAN policy. Czumaj and Vocking [llj gave tight results B (log m/ log log m) of its 
price of anarchy for pure Nash equilibria on uniform machines. Fiat et al. [15] extended this 
result for the strong price of anarchy, and obtained the tight bound G(logm/(loglogm)^). 
In addition, Gairing et al. [2Qj and Awerbuch et al. gave tight bounds for the price of 
anarchy for restricted identical machines. 



Ci = Ci_i + {k-i + l){pij - Pi-i,j) 
= Pi,j + • • • + Pi-i,j + {k-i + l)p, 



(1) 
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Coordination mechanism design was introduced by Christodoulou et al. [lU] • They stud- 
ied the LPT pohcy on identical machines. Immorhca et al. [26] studied coordination mecha- 
nism for all four machine environments and gave a survey on the results for non-preemptive 
strongly local policies. They also analyzed the existence of pure Nash equilibria under SPT, 
LPT and RANDOM for certain machine environments and the speed of convergence to equi- 
librium of the better response dynamics. Precisely, they proved that the game is a potential 
game under the policies SPT on unrelated machines, LPT on uniform or restricted identical 
machines, and RANDOM on restricted identical machines. In [32] it was shown that the 
game does not converge under the LPT policy on unrelated machines. The policy EQUI has 
been studied in [13] for its competitive ratio. The results are summarized in Table [l] 

Azar et al. [H] introduced the inefficiency-based local policy which has price of anarchy 
O(logm) on unrelated machines. Moreover, they also proved that every non-preemptive 
strongly local policy with an additional assumption has price of anarchy at least m/2, which 
shows a sharp difference between strongly local and local policies. 



model \ policy 


MAKESPAN 


SPT 


LPT 


RANDOM 


EQUI 


identical 


2 ^ 

^ m+l 

[HI EH 


2-^ 

m 

[231 [26] 


4 l_ 

3 3m 

[22 ID] 


2 ^ 

^ m+l 

[UlEI] 


2-^ 

m 


uniform 


0/ logm \ 


9 (logm) 

[21 ESI 


1.52 < PoA < 1.59 

[I211IS1I2S] 


0( logm ^ 


0(logm) 


^ loglogm ' 

m 


loglogm ' 

m 


restricted id. 


0/ logm \ 


0(logm) 

[21 ES] 


0(logm) 

[HIES] 


0/ logm \ 


0(logm) 


^ loglogm ' 

[20111] 


loglogm / 

[20111] 


unrelated 


unbounded 
[31] 


e(m) 

[91125116] 


unbounded 


e(m) 

[2S] 


e(m) 



Table 1: Price of anarchy under different strongly local and non-clairvoyant policies. The 
right most column is our contribution. 



1.3 Our contribution 

We are interested in admissible non-clairvoyant policies - policies that always induce a Nash 
equilibrium for any instance of the game. In the game, maybe more important than the 
question of existence of Nash equilibrium is the question of convergence to an equilibria. Since 
no processing time is known to the coordination mechanism it is impossible to compute some 
equilibria or even decide if a given assignment of jobs to machines is an equilibria. Besides, 
if all processing times are known to all jobs, it makes sense to let the jobs evolve according 
to the better-response dynamics, until they eventually reach an equilibria. Therefore in the 
paper, we are interested in the convergence of the better-response dynamic. 

In Section[2| we study the existence of Nash equilibrium under the non-clairvoyant policies 
RANDOM and EQUL We show that for the RANDOM policy, the game is a potential game 
on uniform machines. We also show that on two unrelated machines, it is a potential game, 
but for three unrelated machines or more, the better-response dynamic does not converge. 
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Moreover, we prove that for the EQUI pohcy, the game is a (strong) potential game, see 
Table [21 



model \ policy 


MAKESPAN 


SPT 


LPT 


RANDOM 


EQUI 


identical 








Yes [26J 




uniform 


Yes 

m 


Yes 
[26] 


Yes 


Yes 




restricted id. 


m 


Yes [26] 


Yes 


unrelated 


No [32] 


Yes for m = 2 








No for m > 3 





Table 2: Convergence of the better response dynamic. 



In Section |3| we analyze the price of anarchy and the strong price of anarchy of EQUI. 
We observe that RANDOM is slightly better than EQUI except for the unrelated model. In 
the unrelated model, interestingly, the price of anarchy of EQUI reaches the lower bound 
in [6] on the PoA of any strongly local policy with some additional condition. The latter 
shows that although there is a clear difference between strongly local and local policies with 
respect to the price of anarchy, our results indicate that in contrast, restricting strongly local 
policies to be non-clairvoyant does not really affect the price of anarchy. Moreover, EQUI 
policy does not need any knowledge about jobs' characteristics, even their identities (IDs) 
which are useful in designing policies with low price of anarchy in [6l |8]. 

2 Existence of Nash equilibrium 

The results in this section are summarized as follows. 

Summary of results on the existence of Nash equilibrium: We consider the schedul- 
ing game under different policies in different machine environments. 

1. For the RANDOM policy on uniform machines, it is a potential game. For the RAN- 
DOM policy on unrelated machines, it is not a potential game for 3 or more machines, 
but it is a potential game for 2 machines. 

2. For the EQUI policy it is an exact potential game. 

2.1 The RANDOM policy on uniform machines 

In the RANDOM policy, the cost of a job is its expected completion time. If the load of 
machine j is ij then the cost of job i assigned to machine j is ^{ij +Pi,j). Observe that a job 
i on machine j has an incentive to move to machine j' if and only if pij + ij > 2piji + 

In this section, we consider uniform machines. Let Pi < P2 < • • • < Pn be the job lengths 
and Si > S2 > . . . > be the machine speeds. Now the processing time of job i on machine 

3 is Pi/sj. 
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Theorem 1 The scheduling game on uniform machines and the RANDOM policy is a po- 
tential game. 

Proof: Let a : {1, . . . , n} — )■ {1, . . . , m} be a strategy profile. The proof wiU use a potential 
hinction adapted from previous studies on congestion games [30j and load balancing games 
[221 [18]. We define 

where ij is the load of machine j, i.e. the sum of pi over all jobs i with a{i) = j. 

Now consider a job i that makes a better response move from machine a to machine b. 
If £„, denote the loads respectively of machine a and b before the move, then by definition 
of a better response move we have the inequality 

L+Pi ^ h + '2pi 

Let a' be the profile after the move of job j. The change in the potential is 

_ £g + 24p, + 3p^ - £g ^ £g -2Lp, + pj -el- 3pj 



Sb 

2p.(^?^-^)<0 

Sb Sa 



due to ([2]). Therefore, the potential function $ strictly decreases at every better response 
move. □ 



2.2 The RANDOM policy for unrelated machines 

In the following, we will characterize the game under the RANDOM policy in the unrelated 
model function of the number of machines. 

Theorem 2 The scheduling game on 2 unrelated machines with the RANDOM is a potential 
game. 

Proof: Let a : {1, . . . ,n} {1,2} be the current strategy profile, meaning that job i is 
assigned to machine (j{i)- By a{i) we denote the opposite machine to machine cr{i). Let ij 
be the load of machine j in strategy profile a, which is 'Yl,va{i)=jPi,r Define the potential 
function as 

n 

$(ct) := (^i-^2)' + 3^pt«- 

i=l 
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We claim that the potential function $ strictly decreases at every better response move. 
Let i be a job moving from say machine a to machine 6, while strictly decreasing its cost, 
i.e. 

4 + 2p,,fe -la- Pi,a < 0, (3) 

where ia,h are the loads before the move. 

Let a' be the strategy profile after the move of job i. We have: 

- = it - Vi,a - 4 - V^,b? - {L - 4)' + - via) 

= -{Pi,a + Pi,b){'2.L - Pi,a - 24 - Pi,b) + 3(pi,a + Pi,b){Pi,a " Pi,b) 
= {Pi,a + Pi,b)[KPi.b - Pi,a) - {'^L - Pi,a " 24 - Pi,b)] 
= '2{Pi,a + Pi,bWP^,b + Q - {p^,a + 4)] < 

due to ([3]). Therefore, the potential function $ strictly decreases at every better response 
move. □ 

However, for 3 or more machines, the better-response dynamic does not necessarily con- 
verge. 

Lemma 1 The better-response dynamic does not converge under the RANDOM policy on 3 
or more unrelated machines. 

Proof: We give a simple four-job instance, with the following processing times. For conve- 
nience we name the jobs A, B,C, D. 



Pi,3 


1 


2 


3 


A 


90 


84 


oo 


B 


96 


2 


oo 


C 


138 


100 


oo 


D 


oo 


254 


300 



Now we describe a cyclic sequence of better response moves, where each job strictly decreases 
its cost, showing that the game does not converge. In the following table, we describe in 
each line, the current strategy profile, a better response move of an unhappy job and its 
cost improvement. For example the first line shows the strategy profile, where jobs A, B are 
on machine 1, C is on machine 2 and D on machine 3. Then job the cost of job A is 138 
and moving to machine 2, its cost drops to 134. The subsequent line show similar better 
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response moves, which end in the initial strategy profile. 

12 3 move cost improvement 



AB C D 1A2 138 > 134 

B AC D 1^ 2 96 > 94 

ABC D 2^1 143 > 138 

C AB D 3^ 2 300 > 297 

C ABD 2^1 171 > 165 

BC AD 24 1 211 > 207 

ABC D 1^ 2 231 > 227 

AB CD 2^ 3 304 > 300 



AB C D 



□ 



Note that although there exists a cycle in better-reponse dynamic of the game under 
RANDOM policy, this does not mean that the game possesses no equilibrium, see [28] . 



2.3 The EQUI policy 

In the EQUI policy, the cost of job i assigned to machine j is given by expression ([T]). Here 
is an alternative formulation for the cost 

Ci= ^ Pi>,j + 5Z P^d 

i':a{i')=j i':(7{i')=j 
Pi',j<Pi,j Pi',j>Pi,j 

Lemma 2 The game with the EQUI policy is an exact potential game. 

Proof: If in a game every better response move would strictly decrease the total load, the 
game would converge. Unfortunately the game does not have this property. Also, if a better 
response move would never increase the individual costs of players, again the game would 
converge, since the total individual costs would measure convergence. It happens that the 
game does not have this property either. It turns out that a measure for the convergence 
is in fact an average of two measures above: the sum over all individual costs and the total 
load over all machines. 

Let a be the current strategy profile, meaning a{i) is the current machine on which job 
i is scheduled. Consider the following potential function. 

1 " 
1=1 
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We prove that if a job makes a better response move then the potential function strictly 
decreases. Let t be a job that moves from machine a to b, while stricly decreasing its cost 
from Ct to c[. We have 



Ct 



J2 ^^.»+ 



\ Pi,a<Pt,a 



+ Pt,a 



Pi,a>Pt,a ) 



> 



J2 p^'" + Y 



i:a{i)=b,i^t 
\ Pi,b<Pt,b 



Pt,b 



i:a(i)=b,i^t . 
Pi,b>Pt,b / 



+ Pt,b 



Let a' be the strategy profile after the move of job t. Note that in a' the processing time of 
all jobs except i and the cost of all jobs scheduled on machine different to a and b stay the 
same. Thus, the change in the potential depends only on the jobs scheduled on machines a 
and b. 



2 • A$ = ( Y + Pi,a) + 5^ (4 + Pi,b) + Pt,b 

yi:a'{i)=a i:a'{i)=b,i^t 

Y iCi+Pi,a) + Y i^i+Pifi) +Pt,a\ +{Ct-Ct) 
i:a{i)=a,i^t i:a(i)=b J 

Y (4~^^) + Y (4~^^) +(4-Ct)+Pt,6-Pt,a 



i:a{i)=a,i^t 



i:a{i)=b,iy^t 



since a{i) — a'{i) \/i ^ t. 

Consider a job i 7^ t on machine a. If the processing time of i is at most that of t then 
the difference between its new and old cost is exactly —pi,a- Otherwise if the processing time 
of i is strictly greater than that of t then this difference is exactly —pt,a- Analogously for 
jobs on machine b. Hence, 

/ \ 



2 • A$ 



Y Pi'>' + Y 



. i:a(i)=b,ij^t 
\ Pi,b<Pt,b 



i:a{i)=b,ij^t 
Pi,b>Pt,b 



+ 



I 



( 



+ 



\ Pi,a<Pt,a 

2 ■ (4 - Ct) < 



Pi,a>Pt,a 



+ [c't - Ct) 
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Therefore, the game with the EQUI pohcy is an exact potential game. 
Now we strengthen the statement of the previous lemma. 



□ 



Theorem 3 The game with the EQUI policy is a strong potential game, in the sense that 
the better-response dynamic converges even with deviations of coalitions. 

Proof: Let S" be a coalition and define its total cost c{S) := Xlies'^j- study a better 
response move of S by dividing the process into two phases: in the first phase, all jobs in 
S move out (disappear) from the game and in the second phase, jobs from S move back 
(appear) into the game at their new strategies. We argue that after the first phase, the 
change in the potential is A$ = —c{S) and after the second phase A$ = c'{S). Since the 
argument is the same, we only prove it for the first phase; the second phase can be done 
similarly. Fix a machine a and suppose without loss of generality that all the jobs assigned 
to a are 1, . . . ,k for some k. Also to simplify notation we denote qi = Pi^a and assume 
?i < ■ ■ ■ < Let R — Sf] cr~^{j) = {ii < . . . < v} be the set of jobs in the coalition that 
are scheduled on this machine. Then, 

r r 

^^Cij = ^ {qi + (12 + ■ ■ ■ + Qij-i + (k- ij + l)qi.) 



i=i 



The jobs in R partition the jobs {l,...,k} into r + 1 parts: part j G {0, ...,r} is 
[ij + 1, jj+i], where for convenience we denote io = and ir+i = k. After the move out of 
R, the change in cost of a job t ^ R scheduled on the machine with index in [ij + 1, ij+i] is 
?ii + 9i2 + ■ ■ ■ + Qij-i + (^ ~ j)Qt- Hence, the difference in the potential restricted to machine 
a after the first phase A$|a satisfies: 



-2A$L 



Yl qh + qi2 + ■■■ + Qij-i + {r- j)qt 

telij+l,ij+i] 

+ [c{R) + {qi, +qi, + ... + g^J] 



Qt 



j=0 t^R 

t&[ij+l,ij+i] 



+ [c{R) + {q,, +qi, + ... + qi^)] 

r 

= E (^1 + ^2 + ■ ■ ■ + + {k- ij + l)gi .) + c{R) 
= 2 • c{R) 

where in the first term of these equalities, we distinguish between the cost change of all jobs 
not in the coalition and the cost change of the jobs in the coalition, disapearing from the 
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game. The potential change after the first phase is simply the sum of all the changes over 

all machines, so A$ = —c{S). 

By the same argument, after the second phase we have A<l> = c'{S). Therefore, the net 
change over both phases is c'{S) — c{S). In conclusion, the game is a strong potential game. 

□ 

3 Inefficiency of Equilibria under the EQUI policy 

In this section, we study the inefficiency of the game under the EQUI policy which is captured 
by the price of anarchy (PoA) and the strong price of anarchy (SPoA). Note that the set 
of strong Nash equilibria is a subset of that of Nash equilibria so the SPoA is at most as 
large as the PoA. We state the main theorem of this section. Whenever we bound (S)PoA 
we mean that the bound applies to both the price of anarchy and the strong price of anarchy. 

Summary of results on the price of ancirchy: The game under the EQUI policy has 

the following inefficiency. 

1. For identical machines, the (S)PoA is 2 — ^. 

2. For uniform machines, the (S)PoA is ©(min{logm, r}) where r is the number of dif- 
ferent machine's speeds in the model. 

3. For restricted identical machines, the (S)PoA is ©(logm). 

4. For unrelated machines, the (S)PoA is ©(m). 

We first give a characterization for strong Nash equilibrium in the game, which connects 
the equihbria to the strong ones. This characterization is useful in setthng tight bounds of 
the strong price of anarchy in the game. 

Lemma 3 Suppose in a Nash equilibrium there is a coalition T that makes a collective move 
such that each job in T improves strictly its cost. Then this move preserves the number of 

jobs on every machine. 

Proof: For a proof by contradiction, let rj be an equilibrium that is not strong, and let T be 
a coalition as stated in the claim. Suppose that the number of jobs on the machines is not 
preserved by the move of T. Let j be a machine that has strictly more jobs after the move, 
and among all jobs migrating to j, let o e T be the job with smallest length po- Let k and k' 
be the numbers of jobs on j before and after the move of T, respectively {k' > k). We claim 
that job could already improve its cost by unilaterally moving to j, contradicting that rj is 
a Nash equilibrium. Consider equilibrium 77, if o moves to machine j, its cost would be: 

Co ^ {k + 1- w)po,j + ^ Pi,j 

^ {k - w + l)po,j + Yl Yl P^'^ 

i-Pij <Po,j ji^T i'-PiJ <Po,j ,i&T 
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where w is the number of jobs on machine j in r] with length strictly less than po^j. 

Let w' be the number of jobs on machine j after the move of T with length strictly less 
than poj. Since o has the smallest length among all jobs migrating to j, w' < w. The cost 
of o after the move of T is: 

c'o = {k' - w')po,j + Yl P^d 

i-Pi,j<Po,jMT 

We have: 

c'o - Co = [{k' - w') - {k - w + 1)] pij - ^ pij 

>{W- W')pij - ^ Pi^j 

i-Pi,j<Po,j,i&T 

> {w - w')pij - {w - w')pij = 

where the first inequality follows from k' > k + 1 and the second inequality uses | {i : Pij < 
Po,j, i & T}\ = w — w' . Since job o has incentive to cooperate and move to machine j, o also 
get better off by unilaterally changing its strategy, so rj is not an equilibrium. □ 

3.1 Identical machines 

In case of identical machines, the analysis of the PoA is quite similar to the well-known 
analysis of Graham's greedy load balancing algorithm that assigns the jobs to the least load 
machine, processing jobs in arbitrary order, see [23]. Here we show that the (S)PoA matches 
exactly the approximation factor of the greedy algorithm. 

Proposition 1 For identical machines, the (S)PoA is 2 — — . Moreover, there is an instance 
in which all equilibria have cost at least (2 — ^)OPT. 

Proof: (Upper bound) First we prove that PoA is upper-bounded by 2 — 1/m. Let a be 
an equihbrium and £max be the makespan of this equilibrium. Let i be a job (with processing 
time Pi) that has cost ^max- Hence, pi < OPT. Since a is an equilibrium, the fact that job 
i has no incentive to move to any other machine j implies irnax < + Pi for all machines 
j different to cT(i), where ij is the load of machine j. Summing up these inequalities over 
all machines j we get m^max < Sjli + {fn — Moreover, for any assignment of 

jobs to identical machines, X]j=i^i — ^OPT. Therefore, ml^^x < (2m — 1)0PT, i.e., 
PoA < 2 - 1/m. 

(Lower bound) Now we give an instance in which OPT equals m and all equilibria 
have cost at least 2m — 2. In the instance, there are m machines and m(m — 1) + 1 jobs in 
which all jobs have processing time 1 except one with processing time m. In an optimum 
assignment, the big job is scheduled on one machine and all m(m — 1) unit jobs are evenly 
assigned to the other machines, producing makespan m. We claim that in any equilibrium, 
every machine has at least (m — 1) unit jobs. Suppose there is a machine with at most m — 2 
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jobs. Since there are m(m — 1) jobs of unit processing time, there must be a machine j with 
at least m unit jobs in the equihbrium. A unit job on machine j has cost at least m and 
it has incentive to move to the machine with less than m — 2 jobs and get a smaller cost 
(at most m — 1). This gives a contradiction and shows that any equilibrium, every machine 
has at least m — 1 unit jobs. Now consider the machine with the big job. In addition this 
machine has at least m — 2 unit jobs, so its load is at least m + (m — 2). Therefore, the 
makespan of the equilibrium is at least (2 — 2/m)0PT. 

Consider the schedule in which there are (m — 1) unit jobs on every machine and the 
job with processing time m on some arbitrary machine. It is straightforward that this is an 
equilibrium. By Lemma |3| this equilibrium is also a strong one. Hence, (S)PoA > 2 — 1/m. 

□ 



3.2 Uniform machines 

For uniform machines, an upper bound O(logm) on the PoA of any deterministic policy in 
this machine environment is proved by Immorlica et al. [26]. In this section, we investigate 
the lower bound and show that the bound O(logm) is essentially tight. 

In the following, we present a family of game instances in which the PoA, together with 
the SPoA, are f2(logm). The instances are inspired by the ones proving the lower bound of 
the competitive ratio of the greedy algorithm for uniform machine in [2]. 



Family of Game Instances There are k + 1 groups of machines Go,Gi, . . . ,Gk, each 
machine in group Gj has speed for < j < k. Group Go has mo = 1 machine, group Gj 
has ruj machines which is recursively defined as rrij = Ylt=o ' 2-'"*. Moreover, there are 
k + 1 groups of jobs Jq, Ji, . . . , J^, for < j < — 1 each group Jj consists of 2mj jobs of 
length and group Jk consists of Sm^ jobs of length 2~^. The total number of machines 



for illustration. 



is m = Ej=o^i = 1 + 1(4'' - 1) + 2 ■ 4^^-^ thus k = n{\ogm). See Figure 

Consider a schedule that is a two-to-one mapping from the job group JJ to the machine 
group Gj, for every j < k, and that is a three-to-one mapping from job group to machine 
group Gk- The load on a machine in group Gj for j < k is 2 and each machine in G^ has 
load 3. Hence, OPT < 3. 
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Figure 2: Illustration of the schedule with makespan 3 (upper part) and the strategy profile 
cr (lower part) in the game instance for k = 3. Each machine group is represented by one of 
its machines. 

Consider a schedule (strategy profile) a such that for every < j < A;, in each machine 
of group Gj (with speed 2~^), there are 2 jobs of length 2~^ , and for every j < i < k, 
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there are 2' ^ jobs of length 2 Each machine in group Gj has 2^ ■'^^ jobs and has load 
2 ■ 2-V2"-'' + 2* ■ 2-(-'+*) /2-J' = A; - j + 2, so the makespan of this schedule is A; + 2. We 
claim that this strategy profile is a Nash equilibrium, moreover it is a strong one. 

Lemma 4 The strategy profile a is a Nash equilibrium. 

Proof: First, we show that, in strategy profile a, the cost of a job in Jj is equal to k — j + 2. 
Fix a machine in Gt- We are only interested in case t < j since in a, no job in Jj is assigned 
to a machine of group Gt with t > j. On this machine, there are exactly 2-'^*+^ jobs with 
processing time at least 2^^ . So, the cost of a job in Jj scheduled on this machine is: 

— [(2''"* • 2"*= + 2^^"^)"* ■ 2"(^"^) + . . . + 2(-''+^)"* ■ 2"^^'+^)) + 2-''"*+^ ■ 2"^1 
2— t 

= fc + 2-j. 

Now we argue that a is an equilibrium. Suppose that a job i in Jj moves from its 
current machine to a machine of group Gt- If j < t, i has the greatest length among all jobs 
assigned to this new machine, so the new cost of i is the new load of the machine which is 
{k-t + 2) + 2-^2"* > k- j + 2. li j > t then there are (2^-*+^ + 1) jobs with length at 
least 2^^ on z's new machine. Hence, the new cost of i is: 

— [(2^-* • 2-'= + 2(^=^1)-* ■ 2-('=-i) + . . . + 2(^'+^)-* ■ 2-(^'+i)) + (2^-*+i + 1) ■ 2-^1 
2 

> A: + 2-j. 

Therefore, no job can improve its cost by changing its strategy. □ 
Using Lemma [3} we show that a is indeed a strong equilibrium. 

Lemma 5 Strategy profile a is a strong Nash equilibrium. 

Proof: Suppose a is not a strong Nash equilibrium, then there exists a coalition T such that 
all jobs in T strictly decrease their costs and after the move of T, all machines have the same 
number of jobs as in a (by Lemma |3]). Observe that the cost of a job in J^ (with the least 
length among all jobs in the instance) depends only on the number of jobs scheduled on its 
machine. With such a move of T, if there are some jobs in J^ involved in the coalition, none 
of them can strictly decrease its cost. Hence, T n = 0. Consider jobs in Jk-i- Since jobs 
in Jfc stay in their machines and they incur the same load 1 on each machine, the cost of a 
job in Jk-i, if it involves in T, depends only on the number of jobs which are not in J^ and 
are scheduled on its new machine. However, this number is preserved after the move of T 
(by Lemma |3] and T n Jfc = 0), so the cost of a job in Jk-i stays the same, i.e., the job has 
no incentive to involve in T. The argument holds for groups of jobs Jk-2, ■ ■ ■ , Jo- Therefore, 
T = meaning that a is a strong equilibrium. □ 

The previous lemmas imply that for uniform machines, the PoA of EQUI is fi(logm). 

Theorem 4 For uniform machines, the (S)PoA of EQUI is O(logm). 
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3.3 Restricted Identical Machines 

The upper bound of the price of anarchy of EQUI on restricted identical machines follows 
immediately by Immorlica et al. [26]. In [26], an instance was given which shows that any 
deterministic non-preemptive coordination mechanism has PoA fi(logm). However, EQUI is 
a preemptive policy, and the instance cannot be adapted. In this section, we show that the 
price of anarchy of the EQUI policy on restricted identical machines is also fi(logm) using 
another instance. 

Theorem 5 For restricted identical machines, the (S)PoA is B(logm). 

Proof: The upper bound follows from [2S] • We show now the lower bound. We adapt a game 
instance from the proof of Lemma |5j Let (mj)*^^g be a sequence defined as mo = = 2 

and nij = mo + . . . + rrij-i, i.e., ruj = 3 ■ 2^^^ for every j > 2. Let m = Yl^=o^j — 3 ■ 2^^^^. 
Hence k = ^(logm). 

In the instance, there are m machines which are divided into k + 1 groups Gq, . . . ,Gk 
where group Gj consists of rrij machines. There are also k + 1 job groups Jq, Ji, . . . , Jfc where 
group Jj contains 3 ■ 2^mj jobs of processing time 2~K 

We first describe a schedule /i which will be proved to be a Nash equilibrium. On each 
machine in group Gj for < j < k, there are 2^~^^ jobs of length 2~^ and for every j < i < k, 
there are 2* jobs of length 2~*. The strategy set of each job is the following. Jobs in group Jj 
can be scheduled on all rrij machines of group Gj. Moreover, a job in Jj can be additionally 
scheduled on its current machine in /z. 

We claim that /i is a equilibrium. Observe that on each machine of group Gj, there are 
exactly 2*"*"^ jobs of processing time at least 2~* for all j < i < k and the total load of jobs 
with processing time strictly smaller than 2~* (on the machine) is k — i. Thus, the cost of 
each job in group Jj is /c — z + 2 in and if a job switches the strategy, its cost would be 
strictly greater than k — j + 2. In addition, using Lemma [3] and by the same argument as in 
Lemma [5} we have that this equilibrium is indeed a strong one. 

If we schedule evenly all jobs of group Jj on rrij machines of Gj for < j < k then the 
makespan is bounded by 3, so OPT < 3. The makespan of the strong equilibrium above is 
k + 1, which gives the (S)PoA is at least {k + l)/3 = f2(logm). □ 

3.4 Unrelated Machines 

In this section, we prove that the PoA of the game under the EQUI policy is upper bounded 
by 2m. Interestingly, without any knowledge of jobs' characteristics, the inefficiency of EQUI 
- a non-clairvoyant policy - is the same up to a constant compared to that of SPT - the 
best strongly local policy with price of anarchy G(m). 

Theorem 6 For unrelated machines, the price of anarchy of policy EQUI is at most 2m. 

Proof: For job i, let qi be the smallest processing time of i among all machines, i.e., qi : = 
miiij Pi J and let Q{i) be the machine j minimizing pij. Without loss of generality we assume 
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that jobs are indexed such that qi < q2 Qn- Note that ^"^^ Qi ^ fn ■ OPT, where 

OPT is the optimal makespan, as usual. First, we claim the following lemma. 
We claim that In any Nash equilibrium, the cost q of job i is at most 

2gi + . . . + 2gi_i + {n - i + l)qi. (4) 

The theorem would follow from the claim by the following argument. Since the expression 
Q is increasing in i and at i = n this term is 2 XlILi 1i — ■ OPT, the cost of each job in 
an equilibrium is bounded by 2m ■ OPT, so the price of anarchy is at most 2m. 

The proof of the claim is by induction on i. The cost of job 1 on machine Q{1) would be 
at most nqi, simply because there are at most n jobs on this machine. Therefore the cost of 
job 1 in the Nash equilibrium is also at most nqi. Assume the induction hypothesis holds 
until index i — 1. Consider job i. Since the strategy profile is a Nash equilibrium, z's current 
cost is at most its cost if moving to machine Q{i). We distinguish different cases. In these 
cases, denote as the new cost of i if it moves to machine Q{i) 

1. Case all jobs t scheduled on machine Q{i) satisfy t > i. 

This case is very similar to the basis case. There are at most n — i jobs on machine 
Q{i), beside i. The completion time of job i is then at most {n — i + l)qi which is 
upper bounded by Q. For the remaining cases, we assume that there is a job i' < i 
scheduled on Q{i). 

2. Case there is a job t < i on machine Q{i) such that Pt,Q(i) > Pi,Q(i){= qi)- 

Since Pt,Q{i) > the new cost of job i is not more than the new cost of job t. Moreover, 
the new cost of job t is increased by exactly q^, so the new cost of i is bounded by 

c • < Q + qi 

< 2qi + ... + 2gt_i + (n - t + l)qt + qi 

= 2qi + ... + 2gt_i + 2{i - t)qt + {n - 2i + t + l)qt + qi 

< 2qi + ... + 2qt_i + 2qt + . . . + 2gi_i + {n - t + l)qi, 

where the first inequality uses the induction hypothesis and the last inequality is due 
to t < i and qt < qt+i < ■ ■ ■ < qi- 

3. Case every job t scheduled on machine Q{i) with Pt,Q{i) > qi satisfies t >i. 

Since we are not in the first two cases, there is a job t < i on machine Q{i) with 
'Pt,Q{i) < qi- Let i' be the job of greatest index among all jobs scheduled on Q{i) 
with smaller processing time than g^. All jobs t scheduled on Q{i) and having smaller 
processing time than that of i, also have smaller index because qt < Pt,Q{i) < 
Therefore i' is precisely the last job to complete before i- At the completion time of i' 
there are still qi — Pii,Q{i) < qi — qe units of i to be processed. By the case assumption. 
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there are at most (n — i) jobs with processing time greater than that of i. Therefore 
the new cost of i is at most 



c'i = Ci' + {n - i + l){qi - qi') 

< 2qi + ... + 2gi/_i + {n - i' + l)qi> + {n - i + - qi>) 
= 2qi + . . . + 2gi/_i + {i - i')qi' + {n - i + l)qi 

< 2qi + ... + 2gi/„i + (g^/ + . . . + g^.i) + {n - i + l)qi 

< 2gi + . . . + 2gi_i + {n - i + l)qi 

where the first inequahty uses the induction hypothesis and the third inequahty is due 
to the monotonicity of the sequence {qj)"^^^. 

This completes the proof of the claim, and therefore of the theorem. □ 

We provide a game instance showing that the upper bound analyzed above is tight. The 
instance is inspired by the work of Azar et al. [B]. In the following lemma, we prove the 
lower bound of the PoA of the game under the EQUI policy. 

Lemma 6 The (strong) price of anarchy of EQUI is at least (m + l)/4. 

Proof: Let rij := ^|™_~||^' and n := Y2f=i '"^i- Consider the set of m machines and m groups 
of jobs Ji, J2, . . . , Jm. In group Jj {I < j < m — 1), there are nj jobs that can be scheduled 
on machine j or j ' + 1 except the last group (Jm) which has a single job that can be only 
scheduled on machine m. Each job in group Jj {1 < j < m — 1) has processing time 
Pjj = (^j^YjT = ;r machine j and has processing time Pjj+i = 2{rn-iy. ~ '^^ machine 
j + 1. The job in Jm has processing time Pm.,m. = 1 on machine m. 

Consider the strategy profile in which half of the jobs in Jj (1 < j < m — 1) are 
scheduled on machine j and the other half are scheduled on machine j + 1 (jobs in 
are scheduled on machine m). We claim that this strategy profile is a Nash equilibrium. 
Note that the cost of jobs in the same group and scheduled on the same machine are the 
same. The cost of each job in group Jj on machine j is the load of the machine, because 
its processing time is greater than that of jobs in group J,_i on machine m, and this load 
equals + ^Pjj = ^ + 1 = Each job in group Jj has smaller processing 

time than that of each job in group Jj+i on machine j + 1, thus the cost of the former is 
'•'j+rtj+i ^^^^^^ _ Hence, no job in group Jj (1 < j < m — 1) has an incentive to move 
and the job in group Jm cannot switch its strategy. Therefore, the strategy profile is an 
equilibrium. 

Moreover, we prove that this equilibrium is indeed a strong one. Suppose that it is not 
a strong equilibrium, i.e., there is a coalition S such that all jobs in S can strictly decrease 
their cost. Again, by Lemma [3} the number of jobs on each machine remains the same after 
the move of S. We call a job in group Jj moving up if it moves from machine j to j ' + 1 
and moving down if it moves from machine j ' + 1 to j. First, we claim that no job has an 
incentive to move up. If a job in group Jj moves up, as only jobs in Jj and Jj+i can use 



19 



machine j + I and Pj j+i < Pj+ij+i, its new cost would be Pjj+i ■ {rij + nj+i) which equals 
its old cost. Hence, no one can strictly decrease its cost by moving up. Among all jobs in S, 
consider the one who moves down to the machine j* of smallest index. By the choice of j*, 
there is no job moving down from machine j* and as claimed above, no job moving up from 
j* . Hence, the job moving to machine j* cannot strictly decrease its cost - that contradicts 
to the assumption that all jobs in S strictly get better off. Therefore, the equilibrium is a 
strong one. 

Consider a schedule in which jobs in group (1 < j < m) are assigned to machine j 
and this schedule has makespan 2, hence OPT < 2. The makespan of the above (strong) 
Nash equilibrium is the load on machine m, that is equal to (m + l)/2. Then, the (strong) 
price of anarchy is at least (m + l)/4. □ 

4 Conclusion and Open questions 

In this paper, we studied coordination mechanisms under non-clairvoyant policies. We first 
studied whether some policies are admissible - which is the first property that we expect 
from a policy. We studied in detail the existence of Nash equilibrium under the RANDOM 
and the EQUI policies. Next, we analyzed the inefficiency (PoA) of the EQUI policy and 
showed that the knowledge of the agents processing times is not really necessary, since EQUI 
behaves nearly as good as the best known strongly local policy SPT. One more advantage is 
that there is no need to implement EQUI policy (if using it) since this popular policy exists 
in many operating systems. 

An interesting open question is to answer (prove or disprove) whether the gap of the 
PoA between strongly local and local policies can be closed. Does there exist a (preemptive, 
randomized) strongly local policies with the PoA poly-logarithmic on m? Azar et al. [6] 
proved that with an additional condition, this gap is closed. Can we bypass this condition? 
Besides, does there exist a truthful coordination mechanism based on strongly local policy 
with PoA as o{m) ? 

Another interesting open problem is the speed of convergence to approximate a Nash 
equilibrium for RANDOM and EQUI in the machines environment where equilibrium is guar- 
anteed to exist. 
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